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Description 
INSPECTION SYSTEM AND APPARATUS 

Cross Reference to Related Applications 

[0001] This application is a continuation-in-part and claims pri- 
ority from copending U.S. Patent Application Serial No. 
10/771,628, filed February 3, 2004, which is itself a con- 
tinuation of U.S. Patent Application Serial No. 10/631,469, 
filed July 29, 2003, which claims priority from U.S. Provi- 
sional Patent Application Serial No. 60/444,504, filed 

February 3, 2003. 
Field of the Invention 

[0002] The present invention is directed to methods and systems 
for the inspection of semiconductor wafers and other ma- 
terials such as integrated circuits (IC) and any surface 
benefiting from inspection. Hereinafter, any material sus- 
ceptible of surface inspection by the system herein de- 
scribed contact potential difference imaging device will be 
denoted generally as a "wafer". More particularly, the 
present invention is directed to a method and system for 



the characterization of microscopic and macroscopic de- 
fects through imaging and visualization of the contact po- 
tential difference topology on the wafer surface through 
the use of a non-vibrating contact potential difference 
sensor. 

Background of the Invention 

[0003] The multi-billion dollar global market for semiconductor 
defect management is growing both in absolute terms and 
as a percentage of semiconductor capital equipment in- 
vestment. In general, there are two factors that determine 
the economics of a semiconductor fabrication facility at a 
given utilization level, namely throughput and yield. As 
complex new technologies such as 300 mm wafers, cop- 
per interconnects, and reduced feature (circuit) sizes drive 
the margin of error in fabrication ever lower, new inspec- 
tion technologies are critical to keep yields high and bot- 
tom-line economics attractive. Detection and elimination 
of chemical contamination and other types of defects is a 
constant concern for semiconductor manufacturers and 
equipment suppliers. Contamination can arise from use of 
processing chemicals, processing equipment, and poor 
handling techniques. Contaminants can include, for ex- 
ample, metals, carbon, and organic compounds. Other 



types of defects can result from a wide range of causes, 
including flaws in the semiconductor crystal, improper 
processing, improper handling, and defective materials. In 
addition, many cleaning steps are required in wafer fabri- 
cation, such as but not limited to the semiconductor in- 
dustry. Each step is time consuming and requires expen- 
sive chemicals that may require special disposal proce- 
dures. Existing methods for monitoring or controlling 
these processes are expensive and time consuming. As a 
result, wafers are often cleaned for a longer period of time 
and using more chemicals than are required. 
[0004] Defect detection and characterization systems can be di- 
vided into in-line and off-line systems. "In-line" refers to 
inspection and measurement that takes place inside the 
clean room where wafers are processed. "Off-line" refers 
to analysis that takes place outside of the wafer process- 
ing clean room, often in a laboratory or separate clean 
room that is located some distance from the manufactur- 
ing area. In addition, many of these analytical techniques 
are destructive, which requires either the sacrifice of a 
production wafer or the use of expensive "monitor" wafers 
for analysis. In-line inspection and measurement is crucial 
for rapidly identifying and correcting problems that may 



occur periodically in the manufacturing process. A typical 
semiconductor wafer can undergo over 500 individual 
process steps and require weeks to complete. Each semi- 
conductor wafer can have a finished product value of up 
to $100,000. Because the number of steps and period of 
time involved in wafer fabrication are so large substantial 
work in process can exist at any point in time. It is critical 
that process-related defects be found and corrected im- 
mediately before a large number (and dollar value) of 
wafers are affected. Such defects, regardless of the nature 
of the wafer, semiconductor, IC, or other device, are detri- 
mental to performance and diminish productivity and 
profitability. 

[0005] Many types of defects and contamination are not de- 
tectable using existing in-line tools, and these are typi- 
cally detected and analyzed using expensive and time- 
consuming "off line" techniques (described below) such as 
Total Reflectance X-ray Fluorescence (TXRF), Vapor Phase 
Decomposition Inductively Coupled Plasma-Mass Spec- 
trometry (VPD ICP-MS) or Secondary Ion Mass Spectrome- 
try (SIMS). Since these techniques are used off-line 
(outside of the clean room used to process wafers) and 
usually occur hours, or even days, after the process step 



that has caused the contamination, their value is signifi- 
cantly limited. 

[0006] a brief description of some well known techniques for 
wafer inspection and chemical contamination detection 
are presented in Table 1. This list is not in any sense ex- 
haustive as there are a very large number of techniques 
that are used for some type of semiconductor analysis or 
characterization or for other surface inspection of other 
types of materials. 
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Analytical Technique 


Description 


In-line/Off-line 


Total Reflection X-Ray 
Fluorescence (TXRF) 


X-rays irradiate the wafer within the 
critical angle for total external reflectance, 
causing surface atoms to fluoresce. 


Off-line 


Automated Optica! Microscopy- 


Optical images are acquired and 
automatically analyzed for detection of 
large defects. 


In-line 


Laser Backscattering 


Wafer surface is illuminated with laser 
spots and the angle and/or polarization of 
reflected tight is analyzed to detect and 
classify particles. 


In-line 


Vapor Phase- Decomposition 
Inductively Coupled-Mass 
Spectrometry (VPD TCP-MS) 


Wafers "scanned"' with a drop of HF that is 
analyzed using mass spectrometry. 


Off-Fine 


Secondary Ion Mass 
Spectroscopy (SIMS) 


Ion beam spatters the wafer surface 
creating secondary ions that are analyzed 
in a mass spectrometer. 


Off-line 



[0007] Table 2 summarizes some major advantages and disad- 
vantages of each example technique. In general, off-line 
detection techniques are extremely sensitive to tiny 
amounts of contamination; but such techniques are slow, 



expensive, and complex to operate. Some have limited, or 
no, imaging or surface mapping capability, or are destruc- 
tive in nature. In-line techniques are much faster, non- 
destructive, and provide defect mapping, but have limited 
chemical contamination detection or analysis capability. 



Table 2 



I Analytical Technique 


Advantages 


Disadvantages 


Total Reflection X-Ray 
Fi uorescence (TXRF) 


* Very sensitive 

* Some mapping capability 

* Nondestructive 


* Limited coverage 

* Unpattemed wafers only 


Automated Optical 
Microscopy 


* Fast 

* Relatively low cost 

* Delects a wiflc* nrny** nl'm^rn 

defects (>50 microns) 

* Imaging of wafer surface 

* Non-contact/non-destructive 


* Very limited chemical and 
particle detection 


Laser Bfiekscatterirag 


* Fast 

* Relatively low cost 

* Detects very small particles 

* Imaging of water surface 

* Non-eontaet'non-desiructive 


* Onfv delects oaritcles — no 
chemistry 


Vapor Phase 
Decomposition 
Inductively Coupled- 
Mass Spectrometry 
(VPD ICP-MS) 

j 


* Very sensitive 

* Able to identity wide range of 
contaminants 


* Destructive 

* Slow 

* Expensive 

* Complex 

* Cannot image 

* Only works on bare silicon 


SeeotKiary Ion Mass 
Spectroscopy (SIMS) 


* Very sensitive 

* Detects a wide range of 
contaminants 

* Sub-surface detection 


* Expensive 

* Slow 

* Destructive 



[0008] | n general, existing in-line wafer inspection tools operate 
at production speeds and generate images of the wafer 
surface that are processed to identify and locate defects. 
However, these techniques are, as mentioned above, very 
limited in their ability to detect chemical contamination. 



Laser backscattering systems are limited to detecting par- 
ticles down to sub-micron sizes, and optical microscopy 
systems can only detect chemical contamination that re- 
sults in a visible stain or residue. Both techniques lack the 
ability to identify or classify the chemical composition of 
the particle or contamination. Off-line laboratory tech- 
niques are used to qualify the cleanliness of new pro- 
cesses and equipment, or to analyze defects detected by 
in-line equipment or as part of failure analysis. 
[0009] Another system that has been investigated is the use of 
Contact Potential Difference imaging (CPD). CPD refers to 
the electrical contact between two different metals and 
the electrical field that develops as a result of the differ- 
ences in their respective maximum electronic energy level, 
i.e. their respective Fermi energies. When two metals are 
placed in contact, the Fermi energies of each will equili- 
brate by the flow of electrons from the metal with the 
lower Fermi energy to that of the higher. "Vibrating CPD 
sensor" refers to the vibration of one metal relative to the 
other in a parallel plate capacitor system. The vibration 
induces changes in the capacitance with time, and there- 
fore a signal related with the surface profile. A CPD signal 
can also be generated by the translation of one surface 



past a reference sample through the use of a non-vi- 
brating contact potential difference (nvCPD) sensor(s). 
This translation makes high speed scanning possible. 

[0010] However, even these nvCPD sensors can themselves 
present certain difficulties. At a microscopic level, the 
surfaces of wafers are not flat due to wafer thickness vari- 
ation, materials on the surface, "bowing", and other fac- 
tors. In order to scan the wafer at a close but safe (i.e., 
close to the surface to promote good signal strength but 
far enough away to minimize any possibility of impacting 
the wafer surface) distance, an appropriate sensor height 
must be calculated and set. Thus, the height of the sensor 
above the wafer surface must be measured and controlled 
to produce repeatable results. Furthermore, height control 
is also necessary to minimize the sensor height to im- 
prove resolution and signal strength. However, height is 
difficult to control and measure, as is the appropriate 
height for measurements on each specific wafer. 

[° 01 1 ] It is possible to use one of many commercially available 
height sensors to control the height of the nvCPD sensor 
above the wafer surface. This requires the expense of an 
additional sensor, and the added complexity of a calibra- 
tion routine to determine the position of the nvCPD sensor 



tip relative to measurements made by the separate height 
sensor. 

[0012] a related problem is the difficulty in establishing a point 
of reference for all distance measurements, including 
height, related to an nvCPD scan. A reference point is 
needed to produce useful measurement data for image 
production. 

[0013] | n some sensor systems, such as nvCPD sensors, it is nec- 
essary to separate the sharp peak signal from the other 
two components of the signal (low frequency signal and 
induced noise signals) to locate and measure the contam- 
inated areas of a wafer. This is challenging because the 
sharp peak signal behaves like noise, i.e., it consists of 
sharp peaks that alternate their polarity in high frequency 
mode. Because of this, conventional high frequency filters 
based only on the frequency domain do not work, as they 
would degrade the sharp peak signal significantly along 
with the noise. 

[0014] | n addition, an nvCPD signal is generally delayed in time, 
which impacts on the quality of the nvCPD signal/image. 
As the sampling time increases, the time delay becomes 
larger. The time delay may be explained by the equivalent 
RC circuit modeling the electrical signal path from the 



probe tip to the output of the A/D converter through the 
amplifier, the data acquisition board and the connecting 
lines between them. The equivalent capacitance is mixed 
with the capacitance between the probe and the wafer 
surface, the parasitic capacitance of the connecting lines, 
the internal capacitance of the amplifier, and other known 
conventional effects. The result is that minute feature sig- 
nals are less detectable, and the signal magnitude and 
thus the signal-to-noise ratio are smaller. 

[0015] Furthermore, topographical features of a wafer often pro- 
duce a weak signal in comparison to the signals from 
chemical features. As the usefulness of topographical ver- 
sus chemical features often varies depending on the par- 
ticular circumstances of an imaging application, there ex- 
ists a need to be able to amplify the signal indicating to- 
pographical features or to separate, superimpose, reduce 
or remove signal indicating chemical features. 

[0016] Also, many different types of imaging systems currently 
rely on a chuck to spin a sample material, such a semi- 
conductor wafer, relative to the probe apparatus. These 
current designs scan the sample surface at a constant ro- 
tational speed. The probe then scans the wafer by taking 
circumferential tracks of data at a constant sampling rate. 



Due to constant rotational speed and a constant sampling 
rate, it is apparent that the angular separation of an indi- 
vidual sample will be constant over the wafer surface. 
However, the actual physical spacing of the data in Carte- 
sian coordinates varies with the radius of the track being 
scanned. In effect the data becomes denser as the radius 
decreases. In addition the amount of current generated in 
the sensor is a linear relationship with the relative speed 
of the probe to sample. The actual relative speed of the 
sample to the probe is then related to the radius of the 
track of data being collected so it is not a constant when 
the sample is scanned at a constant rotational speed. This 
results in signals of larger value on the outer radius of the 
sample and lowers the signal towards the center of the 
sample which results in higher signal to noise ratio than if 
data density were maintained at a substantially constant 
level. 

[0017] | n addition, a need exists to increase the overall accuracy, 
speed, and efficiency of current inspection systems. Cur- 
rent systems do not meet the increasing demand from the 
industry to provide a method of testing a wider variety of 
products in a more efficient and faster manner. 

[0018] a critical need therefore exists for a fast, inexpensive, and 



effective means of detecting, locating, and classifying rel- 
atively small quantities of chemical content or features 
and physical features on wafers. In addition, there is a 
need for a system which minimizes cost and complexity of 
the sensor control mechanisms, such as height control. 
Furthermore, there is a need for methods and systems 

that have improved signal processing. 
Summary of the Invention 

[0019] The present invention provides an inspection system that 
is a fast, inexpensive, and effective means of detecting, 
locating, and classifying relatively small quantities of 
chemical content, and physical features on materials, such 
as but not limited to semiconductor wafers, integrated 
circuit devices, liquid crystal display panels, or any mate- 
rial which may benefit from such inspections, while allow- 
ing for a minimization of the complexity of the sensor 
control mechanisms and an improvement in signal pro- 
cessing. In one exemplary embodiment, a wafer inspec- 
tion system of the present invention includes steps for 
identifying a defect on a surface of a semiconductor 
wafer. In an exemplary embodiment, the steps comprise 
providing a semiconductor wafer; providing a non- 
vibrating contact potential difference sensor; scanning the 



semiconductor wafer relative to the non-vibrating contact 
potential difference sensor; generating contact potential 
difference data from the non-vibrating sensor; and pro- 
cessing the non-vibrating contact potential difference 
sensor data to automatically detect a pattern that is char- 
acteristic of a particular type of defect. A system in accor- 
dance with the principles of the present invention involves 
the relative motion of a probe and a testing material. In 
one embodiment, the probe is substantially stationary 
while the testing material is moved relative thereto, such 
as by spinning about an axis parallel with a sensing axis. 
In another embodiment, the probe is moved, such as by a 
rotational method or in a raster-type motion, relative to 
the substantially stationary testing material. 
[0020] | n addition, the system of the present invention provides, 
in an exemplary embodiment, a method for determining a 
reference point for the sensor. In addition, in some em- 
bodiments of the present invention the system includes a 
method for determining the height of the sensor. In addi- 
tion, the present invention may preferably include a 
method for calculating the scan height to allow for static 
or dynamic wafer height variation. Furthermore, a system 
in accordance with the principles of the present invention 



preferably includes signal processing methods and de- 
vices for improving the native signal output of the sensor, 
such as by reducing noise, reducing signal time delay or 
applying various pattern recognition methodologies. 

[0021] | n another exemplary embodiment, the present invention 
relates to a method of biasing the system to provide for a 
stronger topographical signal in comparison to the chem- 
ical feature signal. 

[0022] | n another exemplary embodiment, the system of the 

present invention includes a mechanism to allow for sub- 
stantially uniform data density. In one embodiment, the 
system includes a variable speed chuck that is able to 
compensate for the motion of the probe relative to the 
spinning surface of the material being inspected; thus al- 
lowing the data captured to have a substantially constant 
density. 

[0023] | n another exemplary embodiment, the system of the 
present invention comprises a plurality of probes. The 
plurality of probes may be configured to allow for differ- 
ent inspection scenarios, such as but not limited to using 

a linear or two-dimensional array. 
Brief Description of Drawings 

[0024] FIG. 1 illustrates one embodiment of the nvCPD scanning 



method and system; 

[0025] FIG. 2 illustrates the concept of the contact potential dif- 
ference methodology; 

[0026] FIG. 3 illustrates an nvCPD scanning method; 

[0027] FIG. 4 illustrates the current output of an nvCPD probe as 
it passes over a positive and negative work function tran- 
sition; 

[0028] FIG. 5 illustrates axial orientation of the nvCPD system; 

[0029] FIG. 6 illustrates standard deviation of signals within a 
scan area; 

[0030] FIG. 7 illustrates steps for creating test wafers which are 
partially coated with known concentrations of contami- 
nants; 

[0031] FIG. 8A illustrates one form of scanning nvCPD system 

with a three axis linear positioning system with the nvCPD 
sensor and a wafer mounted on a high speed spindle; and 
FIG. 8B illustrates another form of scanning nvCPD sys- 
tem; 

[0032] FIG. 9 illustrates a flow diagram for the image acquisition 
process of a radially scanned nvCPD imaging system; 

[0033] FIG. 10A illustrates an optical image of a 100mm diameter 
silicon wafer after application of a vacuum pick-up device 



and FIG. 10B illustrates an nvCPD image of the wafer of 
FIG. 10A; 

[0034] FIG. 11A illustrates an optical image of a second silicon 
wafer after applying alcohol while spinning the wafer and 
allowing the alcohol to dry and FIG. 11B is an nvCPD im- 
age of the same wafer of FIG. 11A; 

[0035] FIG 12A illustrates an optical image of a silicon wafer after 
application of a latex glove mark and FIG. 12B is an nvCPD 
image of the same wafer of FIG. 12A; 

[0036] FIG. 13A illustrates an optical image of a silicon wafer 
having human fingerprints on the wafer and FIG. 13B il- 
lustrates an nvCPD image of the wafer of FIG. 13A; 

[0037] FIG. 14 illustrates an nvCPD image of a silicon wafer after 
brushing the wafer surface with a stainless steel tool; 

[0038] FIG. 15 illustrates an nvCPD image of a silicon wafer after 
pressing an aluminum fixture onto the wafer surface; 

[0039] FIG. 16 illustrates a chart depicting a typical nvCPD signal 
where there is a set of peaks comprising a positive peak 
and a negative peak having non-equivalent heights; 

[0040] FIG. 17 illustrates a chart depicting a signal output of one 
embodiment of the present invention where the positive 
peak height is substantially equivalent to the negative 
peak height; 



[0041] FIG. 18 is a detailed view of the Adjust Starting Position 
and Height of Probe Above Surface step of FIG. 9; 

[0042] FIG. 19 illustrates NCVPD processed wafer images before 
deconvolution; 

[0043] FIG. 20 illustrates NCVPD processed wafer images after 
deconvolution; 

[0044] FIG. 21A illustrates a wafer map produced in accordance 
with the principles of the present invention, wherein the 
wafer pattern is one atomic layer thick over native silicon 
oxide;FIG 21B is a graph showing signal strength along a 
single probe track; FIG. 21C is a graph of the signal 
strength versus the density of gold for the wafer map de- 
picted in FIG. 21A; 

[0045] FIG 22A is a 2D Edge Detection optical view using Canny 
Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry);FIG 22B is a 2D Edge Detection image pro- 
duced in accordance with the principles of the present in- 
vention using Canny Algorithm at Multiple Resolutions (#7 
Wafer dipped into a CMP Slurry, threshold=0. 00001, Con- 
tamination Level=24.5);FIG 22C is a 2D Edge Detection 
image produced in accordance with the principles of the 
present invention using Canny Algorithm at Multiple Res- 
olutions (#7 Wafer dipped into a CMP Slurry, threshold 



=0.008, Contamination Level =4.5);FIG 22D is a 2D Edge 
Detection image produced in accordance with the princi- 
ples of the present invention using Canny Algorithm at 
Multiple Resolutions (#7 Wafer dipped into a CMP Slurry, 
threshold=0.01, Contamination Level =1.9);FIG 22E is a 
2D Edge Detection image produced in accordance with the 
principles of the present invention using Canny Algorithm 
at Multiple Resolutions (#7 Wafer dipped into a CMP 
Slurry, threshold=0.012, Contamination Level =1.1);FIG 
22F is a 2D Edge Detection image produced in accordance 
with the principles of the present invention using Canny 
Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry, threshold=0.014, Contamination Level =0.8); 
[0046] FIG. 23A an optical image of 2D Edge Detection produced 
in accordance with the principles of the present invention 
using Canny Algorithm at Different Scales (Qcept #6 Wafer 
dipped into a CMP Slurry);FIG. 23B an optical image of 2D 
Edge Detection produced in accordance with the principles 
of the present invention using Canny Algorithm at Differ- 
ent Scales (Qcept #6 Wafer dipped into a CMP Slurry, 
threshold=0. 00001, Contamination Level =24.3);FIG. 23C 
an optical image of 2D Edge Detection produced in accor- 
dance with the principles of the present invention using 



Canny Algorithm at Different Scales (Qcept #6 Wafer 
dipped into a CMP Slurry, threshold=0.005, Contamina- 
tion Level =9.6);FIG. 23D an optical image of 2D Edge De- 
tection produced in accordance with the principles of the 
present invention using Canny Algorithm at Different 
Scales (Qcept #6 Wafer dipped into a CMP Slurry, thresh- 
old^. 006, Contamination Level =8.2);FIG. 23E an optical 
image of 2D Edge Detection produced in accordance with 
the principles of the present invention using Canny Algo- 
rithm at Different Scales (Qcept #6 Wafer dipped into a 
CMP Slurry, threshold=0.008, Contamination Level =6.9); 
FIG. 23F an optical image of 2D Edge Detection produced 
in accordance with the principles of the present invention 
using Canny Algorithm at Different Scales (Qcept #6 Wafer 
dipped into a CMP Slurry, threshold=0.009, Contamina- 
tion Level =6.4); 
[0047] FIG. 24A shows an image of a semiconductor wafer gen- 
erated by an apparatus of the present invention with no 
bias voltage; FIG. 24B shows the image of the same semi- 
conductor wafer where a 9 volt bias is applied; FIG. 24C 
shows the image generated by an apparatus of the 
present invention, where the bias signal has been elimi- 
nated as shown mathematically below; FIG. 24D illustrates 



the composition of the semiconductor wafer inspected in 
FIGS. 24A-C; 

[0048] FIG. 25A illustrates an apparatus for calibration according 
a calibration process of the present invention for calibrat- 
ing the height of the nvCPD sensor; FIG. 25B shows the 
height sensor positioned above a reference surface so that 
the distance between the reference surface and the height 
sensor is within the range of detection for the height sen- 
sor; FIG. 25C shows the nvCPD sensor moved slowly down 
while the level of the nvCPD signal is monitored; FIG. 25D 
shows that if the desired height is h*, then the height 
sensor is positioned above the surface so that the surface 
is within the measurement range of the height sensor; 
FIG. 25E illustrates that the nvCPD sensor is height ad- 
justed to z*=z3-(h3-hl)-(zl-z2)+h*, which results in a 
height of the nvCPD sensor probe tip above the surface of 
h*. 

Detailed Description 

[0049] a preferred embodiment of the invention is directed to an 
improved use of an nvCPD sensor. In particular, FIG. 1 il- 
lustrates a functional block flow diagram of components 
and operation of one preferred form of an nvCPD scan- 
ning system 10. A nvCPD sensor 12 (see FIG. 2) is based 



on the phenomena of contact potential difference which is 
a voltage generated between two dissimilar materials 
brought in close proximity to each other. An illustration of 
this concept can be seen in FIG. 2. In the case of the wafer 
scanning system 10, the sensor tip 13 forms a first plate 
14 and a wafer 15 having a wafer surface 16 forms a sec- 
ond plate 18 (see FIG. 2.) Probe tip surface 20 of the first 
plate 14 is made of a conducting material with a fixed 
work function - generally, the difference in energy be- 
tween the Fermi level of the solid and the free energy of 
the space outside the solid, including, in metals, the im- 
age potential of electrons just outside the surface. The 
wafer surface 16 of the second plate 18 has a work func- 
tion which can vary due to irregularities in the semicon- 
ductor wafer surface 16 or contaminants or other materi- 
als deposited on the wafer surface 16. When the first plate 
14 and the second plate 18 are electrically connected, the 
Fermi levels of the respective surface equilibrate and form 
an electric field between them. If the work function of the 
sensor tip 13 is fixed, the magnitude of the electric field 
is then related to the distance between the first plate 14 
and the second plate 18, the relative dielectric between 
the first plate 14 and the second plate 18 and the work 



function of the wafer surface 16. In practice the first plate 
14 and the second plate 18 equilibrate rapidly providing 
little to measure. To provide a current flow that can be 
measured, some motion of the sensor tip 12 relative to 
the wafer surface 16 must be realized. In one embodi- 
ment, the nvCPD sensor 12 is moved over the surface at a 
substantially fixed distance and variations in the wafer 
surface 16 cause a current to flow. 
[0050] An illustration of this concept can be seen in FIG. 3. The 
current flow from this nvCPD sensor 12 can be modeled 
by the following equation: 

PV AC 

I ~ C 4- v 

at at 



[0051] wherein C and V are defined as 

&«✓/:$. , t rr " probe wtTfvr 
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[0052] anc | further wherein c q is the permittivity of free space, 
is the relative dielectric constant, A is the area of the 
probe tip, d is the distance between the sensor tip 13 and 



the wafer 15, <t> is the work function of the respective sur- 
face, and e is the charge on an electron. The V term can 
also be described as a difference in surface potentials be- 
tween the nvCPD sensor 12 and the wafer 15. In addition 
the surface potentials on the wafer surface 16 can vary 
due to defects. The overall surface potential is related to 
the underlying materials work function, but it can also be 
affected by adsorbed layers of material on the wafer sur- 
face 16. Even sub mono-layers of materials are known to 
significantly affect the surface potential. 
[0053] The 

dV ^ <frprobe ~ <S>wafer 
dt Al 

term is related to changes in work function on the wafer 
surface 16. It can be seen that the magnitude of this term 
is related to the relative changes in work function on the 
wafer surface 16 and relative speed at which the nvCPD 
sensor 12 is moved over the wafer surface 16. An illustra- 
tion of the signal generated from this can be seen in FIG. 
4. Thus, a system in accordance with the principles of the 
present invention is capable of generating one- 



dimensional signals and two-dimensional images, al- 
though three-dimensional images can be generated. 
[0054] Many defects can present themselves as variations in the 
wafer work function or the overall surface potential. Both 
chemical and physical (i.e., geographical) features of the 
wafer surface and the underlying materials can affect the 
work function of a particular portion or even a single point 
on the wafer surface; thus, these features can be detected 
by a sensor in accordance with the principles of the 
present invention. For instance, variation in semiconduc- 
tor dopant concentrations in the wafer 15 will cause vary- 
ing characteristic work functions. In addition, other mate- 
rials that could diffuse into the wafer 15, such as but not 
limited to copper, will cause variations in work function. 
Within the semiconductor material (or any other material 
susceptible to measurement) itself, mechanical phenom- 
ena such as dislocation pile-ups, cracks, and scratches 
generate local stresses which will change the local work 
function. In addition, adsorbed layers of atomic or molec- 
ular contaminants even at the sub monolayer level will 
generate appreciable surface potential variations. Particles 
deposited on the wafer 16 with a surface potential differ- 
ent than the surrounding wafer material will also create a 



signal. Layers of chemicals commonly used in the wafer 
fabrication process will affect the surface potential of the 
wafer. For instance residual CMP slurry or photo-resist 
would cause local variations in surface potential de- 
tectable by the nvCPD sensor 12 of the present invention. 
Such defects and chemistry have associated with them 
characteristic signatures which enable inspection of the 
wafer surface. 

[0055] T he 




term is related to changes in gap between the nvCPD sen- 
sor 12 and the wafer 15 or variations in the relative di- 
electric constant. Geometrical imperfections in the wafer 
surface 16 or particles on the wafer surface 16 would 
manifest themselves in this component. Also because of 



its differential nature, the magnitude of this component 
would also increase as the relative speed of the nvCPD 
sensor 12 to the wafer 15 is increased. 

[0056] As previously mentioned, physical or geographical aspects 
and defects can be imaged using a system in accordance 
with the principles of the present invention. Many classes 
of wafer defects would appear as geometrical changes in 
the wafer surface 16. In the wafer 15 itself, surface cracks, 
scratches and etched trenches would be nonlimiting ex- 
amples of such defects causing a geometrical change in 
the wafer surface and an attendant change in the work 
function. In addition, particles deposited on the wafer 15 
would also present themselves as a local change in the 
distance to the probe sensor tip 13. 

[0057] Variations of dielectric films on the wafer 15 can also be 
detected. An example would be detecting variations in the 
oxide state grown on the silicon substrate (i.e. SiO, Si0 2 , 
SiO , SiO ). In addition, variations in dielectric of other 

3 4 

non-conducting materials commonly deposited on the 
wafer could be detected. 
[0058] it should also be noted that many features could present 
themselves as combinations of geometrical changes and 
chemical changes. For instance, a particle deposited on 



the wafer 15 of differing material than the underlying 
wafer 15 could cause variation. Also, a crack in the sur- 
face would also induce stresses that would cause varia- 
tions in local work function. 

[0059] | n FIG. 5 is schematically shown one form of the system 
10 for application of the nvCPD sensor 12 to scan the 
wafer 15 for defects and contamination. FIGS. 8A and 8B 
also illustrate more detailed drawings of two alternative 
operating embodiments of the system 10. The system 10 
in FIG. 5 includes an X-Y-Z positioning system 26, a ro- 
tating wafer stage 28, a high speed data acquisition sys- 
tem 30 with a personal computer (PC) 32, and control 
software executed by the PC 32. 

[0060] As shown in more detail in FIG. 8A, in one embodiment, 
the wafer 15 is affixed to a rotating spindle or chuck 54 
(see FIG. 1) using a clamping fixture 56 on the wafer 
edges. A sensor positioning system 50 includes an nvCPD 
sensor 52 positioned a fixed distance from the wafer 15 is 
mounted to a spindle 54. The wafer 15 (not seen in this 
view) is then rotated at high speed, and the nvCPD sensor 
52 is translated radially to collect data in circumferential 
tracks. The scanning procedure as shown schematically in 
FIG. 9 lasts between a few seconds and several minutes, 



depending on the number of scanned tracks, the speed of 
the spindle 54, and the speed of the sensor positioning 
system 52. The tracks of data are then put together to 
form a CPD image. These CPD images allow the visualiza- 
tion of chemical and geometrical defects and thereby en- 
able classification of the type of defect present on the 
wafer surface. Some examples of these CPD images can 
be seen in FIG. 10A-15 and are taken from a 100mm 
wafer compared with optical images of the same wafer 
(see, Example infra). The present invention is capable of 
generating image maps of one atomic layer thick patterns, 
as shown in FIG. 21A. FIG. 21B illustrates the signal 
strength as the wafer is rotated relative to the probe, thus 
passing over defects and features of the wafer surface. As 
shown in FIG. 21C, the present invention, in fact, detected 
sputtered gold at densities less than a single complete 
atomic layer. 

[0061] The images generated by the scanning procedure of FIG. 9 
were subsequently processed to automatically locate de- 
fects; thus locating areas of high variability. An ideal sur- 
face would exhibit a flat signal, but a wafer surface with 
defects would exhibit some variability in the signal. To lo- 
cate areas with defects, the data was broken up into small 



areas of known location. The standard deviation of the 
signal within these areas was determined. Areas with de- 
fects showed a higher standard deviation, and these re- 
sults can be seen in FIG. 6. Areas with defects appear 
brighter than lower variability areas of the wafer 15. This 
is one of many possible methods in accordance with the 
principles of the present invention to process the sensor 
data. 

[0062] M 0re generally, a defect can be identified by one or more 
of the following methods:- Process the data to look for a 
voltage or change in voltage (or pattern of voltages or 
changes in voltages) that exceeds some user-defined 
value (threshold). 

[0063] _ Compare the data to a known pattern that represents a 
defect via some form of correlation or template matching. 

[0064] _ Convert the spatial data to the frequency domain and 
then identify peaks in the frequency domain that repre- 
sent defects with unique spatial characteristics. 

[0065] These techniques can also be combined with other tech- 
niques to yield analytical results. The signal may also be 
preprocessed to facilitate defect detection, such as, for 
example: 

[0066] _ since the signal is differential, it can be integrated over 



some distance to produce voltages that represent relative 
CPD's over the surface of the wafer 15. 

[0067] _ |f the wafer 15 is "patterned", then this known pattern 
can be removed from the data prior to processing. This 
would likely be accomplished through some conventional 
method of variation of image or signal subtraction in ei- 
ther the space or frequency domains. 

[0068] _ The signal would likely be processed with some form of 
frequency filtering to remove high or low frequencies de- 
pending on the size, shape and other characteristics of 
the expected defects. 

[0069] _ The signal could be processed to remove features of a 
certain size by doing what is called "morphological pro- 
cessing" which is by itself well known in other applica- 
tions. 

[0070] | n one embodiment, a defect is detected and the contami- 
nation level is quantified based on an edge detection al- 
gorithm, such as but not limited to a Canny Edge detec- 
tion algorithm. Multiple resolutions may be used or multi- 
ple scales or a combination thereof. FIGS. 22B-F depict 
the edge detection at various resolutions and is shown in 
comparison to an optical image (FIG. 22A). FIGS. 23B-F 
depict edge detection at various scales and in comparison 



to an optical image (FIG. 23A). In a preferred example of 
such an embodiment, the contamination is detected and 
quantified using the steps of: 

[0071] _ Generating a CPD sensor peak signal at the boundary 
between two different areas (The peak signals behave 
much like the "edges" , an image processing term. So, the 
contaminated area can be located by edge detection.); 

[0072] _ Apply an Edge detection algorithm (such as the 2D 
Canny algorithm);- Multiple resolutions with different 
thresholds (thereby enabling detection of various size of 
contaminants, i.e. the higher resolution (lower threshold) 
will find the smaller contaminants); 

[0073] _ Quantifying contamination level (CL) by the edge area 
over the total wafer area in the simplest way. 

[0074] As previously discussed, determining a reference point for 
the sensor is necessary for optimal results. In one embod- 
iment, the reference point is at the center of rotation (in 
the X-Y plane) and at the height of the surface of the 
wafer (on the Z axis). To find this point, the center of ro- 
tation and the height of the surface of the wafer must be 
determined, and then the height sensor is correlated with 
the Z position of the nvCPD sensor. 

[0075] jo fj nc | the center of rotation, the nvCPD sensor and mo- 



tion system are used to find a geometrical and/or chemi- 
cal feature on the surface of the spinning wafer at three or 
more points. Since the wafer is spinning, the feature de- 
scribes a circle. The center of the circle is the center of 
rotation. Given the coordinates of three distinct points 
(A(x i ,y i ), B(x 2 ,y 2 ), and C (x,y)) on the diameter of the de- 
fined circle on the circle, its center is found algebraically 
by the equation:(x -x^x -x^ + (y -y^Cy -y ) = 0. 

[0076] D ue t 0 slight measurement errors, a different set of points 
might yield slightly different center coordinates. The 
"true" center of rotation is deemed to be the locus 
(average) of these points. 

[0077] | n one embodiment, to find the height of the surface of 

the wafer without touching the wafer surface, two sensors, 
the nvCPD sensor and a height sensor (which could itself 
be an nvCPD sensor in an embodiment discussed below) 
can be used. The nvCPD sensor and height sensor are cal- 
ibrated so that when a reading is taken with the height 
sensor, the Z-axis coordinate of the tip of the nvCPD sen- 
sor is ascertained. (This calibration procedure is described 
below.) At that point, the readings of the height sensor 
are correlated with the Z position of the nvCPD sensor. 
Thereafter, the height sensor is used to detect the posi- 



tion of the surface of the wafer without touching it, and 
then the tip of the nvCPD sensor positioned accordingly. 
[0078] | n one embodiment, the height sensor is correlated with 
the Z position of the nvCPD sensor based on two assump- 
tions: first, that within its usable range, measurements 
from the height sensor are linear in the Z axis and that a 
constant, k, can map changes in height measurements to 
proportional changes in Z; and second, that the relative 
positions of the height sensor and nvCPD sensor are 
fixed, i.e. the two sensors can move relative to the rest of 
the world but only as a unit; they, therefore, cannot move 
independently. Based on these assumptions, a point, P, is 
picked in the X-Y plane where calibration is to be per- 
formed. The height sensor is positioned above P, and a 
measurement from the height sensor, Hm, correlated with 
a coordinate on the Z axis, Z . Next the nvCPD sensor is 

h 

positioned above P and move it down until it touches at a 
point, Z^. The nvCPD signal changes significantly when the 
sensor tip touches the surface. Once these values are 
known, the Z value of the point where the tip of the nvCPD 
sensor would touch the surface is derived with the follow- 
ing equation: 

^surface ™ Zcunciit 1 Z c " + (H m " Hcmteitf } ' k 



[0079] wherein: 

[0080] z is the height of the surface where the tip of the 

surface 

nvCPD sensor would touch 
[0081] z is the current height of the sensor 

current 

[0082] h j S the current height sensor measurement 

current 

[0083] As previously mentioned, the height of the sensor should 
be measured and controlled to produce repeatable results. 
It is also possible to use an nvCPD sensor to control the 
height in a semiconductor wafer inspection system in ac- 
cordance with the principles of the present invention. In 
order to use the nvCPD sensor to control height, the sys- 
tem must provide the capability to apply a time-varying 
bias voltage between the probe tip and the wafer surface. 
As the bias voltage varies, it produces an output signal 
that is a function of the capacitance between the probe tip 
and the wafer surface. The closer the probe tip is to the 
surface, the larger the output voltage. After the relation- 
ship between height and capacitance is determined, the 
magnitude of the output signal can be used to calculate 
the height of the sensor. The signal magnitude can be 
calculated as the peak-to-peak, standard deviation, RMS, 
or some other measure known in the art. 



[0084] Again, the formula for the output of the nvCPD sensor is: 




[0085] The voltage V is the contact potential difference between 
the probe tip and the wafer surface. If a bias voltage is 
applied, the formula then becomes: 



[0086] where Vb is the bias voltage. If the nvCPD sensor is not 
moving relative to the surface of the wafer (or is moving 
relatively slowly), then the capacitance C and the contact 
potential difference voltage V are not changing, and the 
equation becomes: 




[0087] since the bias voltage is a known fixed frequency and 

magnitude, the output current is a function of the capaci- 
tance (C). C is a combination of the capacitance between 
the probe tip and wafer surface, and any stray capaci- 
tances in the circuit. The capacitance vs. height function 
can be characterized and used to determine the height of 
the nvCPD probe at a point above the wafer surface. Once 
the height of the sensor is determined, then the bias volt- 
age can be turned off in order to make scanning nvCPD 
measurements. 

[0088] However, in some embodiments prior to scanning a por- 
tion of the wafer, a height profile is established with a 
height sensor and then the scanning height of the nvCPD 
sensor adjusted appropriately. FIG. 18 depicts one em- 
bodiment which utilizes a height profile of the wafer to 
position the sensor. The height profile is determined by 
first moving the height sensor to the center of rotation 
and then, with the wafer spinning, the height sensor is 
moved out toward the edge of the wafer until it senses the 
edge. Note that this also allows the diameter of the wafer 
to be determined. The sensor is then moved back toward 
the center until it is within the wafer flat(s) or notch. One 
or more height measurements taken along the way estab- 



lish the profile. An appropriate height for nvCPD sensor 
scanning is calculated based on the profile, particularly 
based on the maximum detected height. 

[0089] As mentioned above, often the nvCPD sensor used in ac- 
cordance with the principles of the present invention gen- 
erate a peak signal that behaves like noise. In accordance 
with the principles of the present invention, denoising al- 
gorithms can be applied to both nvCPD signals and nvCPD 
images. In one embodiment, the nvCPD signal/image data 
are decomposed into the wavelet domain using one of the 
wavelets available such as but not limited to 'Coiflet', 
'Daubechies 1 , 'Symmlet 1 , and other such wavelets. Then, as 
a result of the wavelet decomposition, a series of wavelet 
coefficients are obtained at a finite number of scales that 
can be given by the user. A coefficient at a particular scale 
represents the magnitude of the frequency corresponding 
to that scale at the point corresponding to that coefficient. 
The nvCPD signal/image can then be reconstructed by the 
coefficients in reverse order. 

[0090] By adjusting the coefficients and performing reconstruc- 
tion, the three components (peak, low frequency, and 
noise) of the nvCPD signal/image can be selectively fil- 
tered out. To eject the low frequency component from the 



nvCPD signal/image, only wavelet coefficients at fine 
scales are used for reconstruction since the low frequency 
component of the nvCPD signal/image are represented by 
the coefficients at coarse scales. To eject the noise from 
the nvCPD signal/image, the coefficients at fine scales can 
be shrunk softly based on the threshold given. The 
threshold can be determined using any one of numerous 
methods known in the art such as, but not limited to, 
'Visu', 'SURE', 'Hybrid', 'MinMax'. The sharp peak signal 
that is related to contamination on the wafer can be re- 
constructed substantially in isolation by the wavelet coef- 
ficients resulting after the two processes above. Thus, 
noise such as vibrations or a wobbling of the wafer can be 
filtered out of the signal. FIG. 19 depicts an image pro- 
duced by a system in accordance with the principles of the 
present invention without deconvoluting or denoising the 
data. FIG. 20 illustrates the improved resolution and defi- 
nition of an image which is denoised in accordance with 
the principles of the preferred embodiment. 
[0091] a semiconductor wafer inspection system in accordance 
with the principles of the present invention which utilizes 
a nvCPD sensor may, as discussed above, experience a 
time delay. However, the present invention provides a fil- 



tering technique to remove this time delay. First, the time 
delay circuit is modeled as a first order RC circuit. The 
continuous-time transfer function of the RC circuit is 
given by 

X(s) zs + 1 

[0092] where X(s) and Y(s) are the Laplace transformation of the 
input current signal at the probe tip and the output volt- 
age measurement to the data acquisition, and t is the 
time delay constant. 

[0093] The continuous current signal is fed into and amplified by 
the amplifier, and then converted into a discrete signal 
through the A/D converter. In this way, the collected data 
by the computer at the final stage is a series of discrete 
data. For digital signal processing, the continuous-time 
transfer function of the RC circuit is converted into a dis- 
crete-time transfer function based on Z-transformation. 



This discretized transfer function has the form 
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X(z) z + fi 

wherein the constants a and 3 are determined by the dis- 
cretization method employed, the sampling time and the 
time delay constant, t. 

[0094] Next, in a preferred embodiment, the impulse response of 
the discretized transfer function is determined. In general, 
the impulse response is a finite number of positive dis- 
crete values that converges to zero gradually. Once the 
impulse response is found, the deconvolution process 
with the impulse response is performed on each track 
data separately. 

[0095] Time constant prediction is important and can be as- 
sessed by comparing the positive peak height and the 
negative peak height. Figure 16 shows atypical nvCPD 
signal where there is a pair of a positive peak and a nega- 
tive peak. It is shown that the positive peak is higher than 



the negative peak. With zero time delay, the signal would 
look like Figure 17, where the positive peak height is 
equivalent to the negative peak height. 

[0096] By comparing the positive peak height with negative peak 
height, the time constant can be estimated correctly. If the 
time constant is underestimated, the former peak (in this 
example, the positive peak) is higher than the latter peak 
(in this example, the negative peak). If the time constant 
is overestimated, the former peak is lower than the latter 
peak. By varying the time constant, a point when the posi- 
tive and negative peaks are equivalent in height could be 
found to predict the time constant correctly. 

[0097] | n one exemplary embodiment, the present invention pro- 
vides a method and apparatus to allow for the isolation of 
the topographical information by filtering out the chemi- 
cal information from the sensor apparatus. This embodi- 
ment utilizes a direct current (D.C.) bias applied to the 
system. In one exemplary embodiment, the bias is applied 
to the sensor. In another exemplary embodiment, a simi- 
lar bias is applied to the sample surface directly instead of 
to the sensor. A first scan of the sample surface is taken 
with a negative bias applied to either the sample or the 
probe and the data is recorded. A second scan of the 



wafer is taken with a positive bias applied to which ever of 
the probe or the probe had the bias applied during the 
first scan. One of ordinary skill in the art will appreciate 
that this order could be reversed and the positive bias 
scan could be done first followed by a negative bias scan. 
The signal with the negative bias applied is then sub- 
tracted from the signal with the positive bias leaving a 
signal related only to geometric change on the sample 
surface. FIGS. 24A-D illustrate the images of a wafer at 
each of the various steps. FIG. 24A shows an image of a 
semiconductor wafer generated by an apparatus of the 
present invention with no bias voltage. FIG. 24B shows the 
image of the same semiconductor wafer where a 9 volt 
bias is applied. FIG. 24C shows the image generated by an 
apparatus of the present invention, where the bias signal 
has been eliminated as shown mathematically below. FIG. 
24D illustrates the composition of the semiconductor 
wafer inspected in FIGS. 24A-C. As can be seen, the geo- 
metrical or topographical features were strengthened 
while the chemical features were weakened. The basic 
equations relating to nvCPD inspection can be used to il- 
lustrate this embodiment. 
[0098] As previously stated, the basic CPD equation is: 



at ' at 

The equation with positive bias applied is: 

dV dC 
at at 



The equation with negative bias applied is: 

dV dC 



Thus, by subtract the signal with a negative bias from the 
signal with a positive bias results in: 
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Solving the difference shows the signal dependent on only 
geometry changes of surface (represented by capacitance) 



[0099] | n an exemplary embodiment, the present invention re- 
lates to a method and apparatus to allow for the preserva- 
tion of signal to noise ratio and for providing a substan- 
tially uniform data density by varying the rotational speed 
to provide substantially uniform linear speed of the sam- 
ple relative to the probe. A variable speed chuck is pro- 
vided which decreases the rotational velocity in proportion 
with the motion of the probe to provide the probe with 
substantially even data exposure. Thus, the chuck is able 
to compensate for the increasing data per revolution by 
reducing the revolutions per minute to maintain a sub- 
stantially even data density. 

[0100] | n an exemplary embodiment, the system of the present 
invention includes a plurality of probes. The tips of each 
of the plurality or probes may be arranged in a variety of 
different arrangements known in the art, including but not 
limited to linear arrays and two-dimensional arrays. It has 
been shown that multiple probe tips in a variety of config- 



urations, as just discussed, provide the system of the 
present invention with a decrease in the time required to 
scan a sample surface. The decrease in speed is inversely 
proportional to the percentage increase in the number of 
probes used. The individual probes in the plurality have, 
in one exemplary embodiment, varying characteristics. 
Such characteristics may include, but are not limited to, 
bias voltage and height. It is believed that the use of a 
plurality of probes provides, in addition to a decreased in- 
spection time, an improvement in lateral resolution and 
chemical sensitivity. Such an improvement in lateral reso- 
lution and chemical sensitivity may be accomplished, in 
one embodiment, by the use of differentially comparing 
the separate probe data streams such as by a combination 
of separate voltage tracks. 
[0101] | n another exemplary embodiment, a method is provided 
for calibrating the height of the probe tip. This procedure 
assumes that the height sensor and nvCPD sensor are 
rigidly mounted with respect to each other, and that the 
relative height of the sensors (z) can be precisely deter- 
mined. In our system, the height sensor and nvCPD sensor 
are mounted to the same metal fixture, and their relative 
height is determined by reading the z-axis encoder on the 



positioning system. 
[0102] | n one exemplary embodiment illustrated in FIGS. 25A-E, 
the present invention includes a calibration process for 
calibrating the height of the nvCPD sensor. The calibration 
process is as follows. (1) As shown in FIG. 25B, the height 
sensor is positioned above a reference surface so that the 
distance between the reference surface and the height 
sensor is within the range of detection for the height sen- 
sor. The height of the sensors is recorded as zl. The 
height sensor reading is recorded as hi. As shown in FIG. 
25C, the nvCPD sensor is moved slowly down while the 
level of the nvCPD signal is monitored. When the nvCPD 
sensor probe tip contacts the reference surface then the 
output of the nvCPD sensor experiences a significant 
change. This is automatically detected by the scanning 
system and the downward motion stops. The height is 
recorded as z2. The nvCPD sensor probe tip can now be 
positioned at a desired height above any surface. This is 
accomplished as follows. As shown in FIG. 25D, if the de- 
sired height is h*, then the height sensor is positioned 
above the surface so that the surface is within the mea- 
surement range of the height sensor. The height is 
recorded as z3 and the height sensor reading is recorded 



as h3. As shown in FIG. 25E, the nvCPD sensor is then po- 
sitioned above the same point and the height adjusted to 
z*=z3-(h3-hl)-(zl-z2)+h*, which results in a height of 
the nvCPD sensor probe tip above the surface of h*. 

[0103] Although the present invention has been frequently de- 
scribed in relation to the scanning of a semiconductor 
wafer which is spun relative to a probe which takes cir- 
cumferential tracks of data, one skilled in the art would 
appreciate that the present invention is not limited to 
such. For example, the present invention may, in one ex- 
emplary embodiment, be used to scan liquid crystal dis- 
play panels, which are generally to bulky to be spun. In 
this embodiment, the probes are raster-scanned across 
the sample surface. In addition, in another exemplary em- 
bodiment, the probe may be held stationary and the sam- 
ple surface moved relative thereto. In yet another exem- 
plary embodiment, the sample surface may be held sta- 
tionary and the probe may be moved relative thereto. 

[0104] The following non-limiting example describes methods of 
preparation of test wafers and sensing characteristic im- 
ages for identifying certain defect states, chemical states, 
electrostatic states and mechanical features present on a 
semiconductor wafer surface. 



EXAMPLE 



[0105] Sample wafers can be created by dip coating the wafer 15 
in solutions that contain known concentrations of con- 
taminants. Part of this example describes metal contami- 
nants such as Cu and Fe, although any manner of chemi- 
cal contaminants can be evaluated in this way. The wafer 
15 described is either a 100 mm or 150 mm wafer, al- 
though these examples apply to any size wafer. The wafer 
surface 16 is prepared by dipping in HF to remove oxides. 
The wafer 15 is then cleaned and partially dipped in the 
metal contaminant solution. The amount of solution re- 
maining on the wafer 15, and the resulting concentration 
of contaminant on the wafer surface 16, is controlled by 
selecting dip coating parameters such as the extraction 
rate. 

[0106] Partial dipping of the test wafer 15 is preferred to create a 
transition from clean to contaminated areas. Because the 
nvCPD signal is differential, the nvCPD sensor 12 detects 
changes on the wafer surface 16, as opposed to an abso- 
lute value relating to surface condition. This aspect of 
nvCPD sensors 12 is offset by the ability to rapidly image 
and detect localized contamination anywhere on the sur- 
face of the wafer 15. 



[0107] After preparation, each test wafer 15 can be, if necessary, 
analyzed using an appropriate combination of XPS, Auger, 
and RBS (or other well known surface analysis methods) 
techniques to measure actual contaminant concentrations 
in the dipped areas of the wafer 15. Each step involved in 
the sample wafer preparation process is shown in FIG. 7. 
In a production line methodology, standards can be es- 
tablished correlating measure actual contamination con- 
centration to nvCPD data for routine use. 

[0108] After each sample wafer 15 is created, it can be imaged 
using a radially scanning nvCPD imaging system 10 con- 
structed in accordance with the invention. As described 
before, FIGS. 8A and 8B show basic forms of the nvCPD 
imaging system 10, and FIG. 9 shows another flow dia- 
gram illustration of wafer processing. The system 10 em- 
ploys the nvCPD sensor 12 mounted on the previously de- 
scribed three-axis positioning system 26. This positioning 
system 26 is used to position the nvCPD sensor 12 above 
the wafer surface 16 to be imaged, and to scan the nvCPD 
sensor 12 radially across the wafer surface. The wafer 15 
is mounted on a spindle that rotates at high speed (1800 
rpm) beneath the nvCPD sensor 12. The system 10 oper- 
ates by acquiring multiple consecutive tracks of data as 



the nvCPD sensor 12 is stepped along the radius of rota- 
tion of the wafer 15. 
[0109] The imaging system 10 has been used for a variety of sur- 
face analysis experiments. FIGS. 10A, 10B, 11A, and 11B 
show sample wafer images that were generated using the 
nvCPD sensor 12 imaging for wafer inspection. The im- 
ages show optical images in FIG. 10A and 11A and nvCPD 
images in FIG. 10B and 11B of a 100 mm form of the 
wafers 15. The first wafer 15 was cleaned, and then a 
small vacuum pick-up device was attached to the surface 
of the wafer 15 in three locations. The optical image of 
FIG. 10A shows no evidence of any change on the surface 
16 of the wafer 15. The nvCPD image of FIG. 10B shows a 
very large signal at the locations where the pick-up device 
was applied. The nvCPD signal is believed to be the result 
of a small amount of residue left on the surface 16 by the 
pick-up device. 

[0110] The second set of images in FIGS 11A and 11B show a 
wafer 15 that has had alcohol spun-on and then dried. 
The resulting residue is not visible in the optical image 
FIG 11A, but is clearly visible in the nvCPD image FIG. 11B. 
These images provide a clear demonstration of the use- 
fulness of nvCPD sensor 12 for wafer inspection. Through 



careful measure of a full range of defect states and chem- 
ical constituents it is possible to correlate an image with a 
particular chemical state, defect, or combination thereof. 
Figures 12A and 12B show, respectively, an optical image 
of latex glove marks and a nvCPD image of latex glove 
marks. Figures 13Aand 13B show, respectively, an optical 
image of human fingerprints and an nvCPD image of the 
fingerprints. FIG. 14 shows a nvCPD image of a wafer 15 
after brushing the wafer 15 with a stainless steel tool, and 
FIG. 15 shows a nvCPD image of the wafer 15 after press- 
ing an aluminum fixture onto the wafer surface 16. All 
these example images were acquired using the nvCPD 
sensor 12 with the probe sensor tip 14 having a diameter 
of approximately 60 microns measured over a period of 
approximately 30 seconds. 
[0112] while preferred embodiments of the invention have been 
shown and described, it will be clear to those skilled in the 
art that various changes and modifications can be made 
without departing from the invention in its broader as- 
pects as set forth in the claims provided hereinafter. 



